Scoring and summarising gene product clusters using the Gene Ontology

نویسندگان

  • Spiridon C. Denaxas
  • Christos Tjortjis
چکیده

We propose an approach for quantifying the biological relatedness between gene products, based on their properties, and measure their similarities using exclusively statistical NLP techniques and Gene Ontology (GO) annotations. We also present a novel similarity figure of merit, based on the vector space model, which assesses gene expression analysis results and scores gene product clusters' biological coherency, making sole use of their annotation terms and textual descriptions. We define query profiles which rapidly detect a gene product cluster's dominant biological properties. Experimental results validate our approach, and illustrate a strong correlation between our coherency score and gene expression patterns.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evaluation of Protein Complexes in Muscular Atrophy Using Interaction Map Analysis

Background and purpose: Muscular atrophy is a condition derived from different diseases and aging. Molecular study of the disease condition can help in developing diagnostic methods and treatment approaches. In this study, protein interaction network was analyzed to understand molecular events at protein levels. Materials and methods: In this experimental study, the network was constructed and...

متن کامل

Identification and prioritization genes related to Hypercholesterolemia QTLs using gene ontology and protein interaction networks

Gene identification represents the first step to a better understanding of the physiological role of the underlying protein and disease pathways, which in turn serves as a starting point for developing therapeutic interventions. Familial hypercholesterolemia is a hereditary metabolic disorder characterized by high low-density lipoprotein cholesterol levels. Hypercholesterolemia is a quantitativ...

متن کامل

Mapping of TP53 protein network using cytoscape software

TP53 acts as a tumor suppressor in cancer. It induces cell cycle arrest or apoptosis in response to cellular stress and damage. p53 gene alteration could cause uncontrolled cell proliferation.In the present study, we used TP53 gene as the seed in the construction of a protein-protein functional association network to identify genes that might involve in tumorgenesis process with TP53. TP53 prot...

متن کامل

The in Silico Characterization of a Salicylic Acid Analogue Coding Gene Clusters in Selected Pseudomonas Fluorescens Strains

Background: The microbial genome sequences provide solid in silico framework for interpretation their drug-like chemical scaffolds biosynthetic potential. The Pseudomonas fluorescens species is metabolically versatile and producing therapeutically important natural products.Objectives: The main objective of the present study was to mine the publically available data of P. fluorescens stra...

متن کامل

شناسایی مولکولی فسفولیپاز D به عنوان عامل مؤثر در رشد و بیماری‌زایی میکروارگانیسم‌ها

Background and Objective: Secretory extracellular Phospholipases are generally involved in hydrolysis of extracellular phospholipids and thus providing nutritive source of carbon, nitrogen, and phosphate. However, intracellular phospholipases perform metabolic functions and adjust biologic activities. Synthesis of phospholipases in different pathogenic microorganisms and their mode of action in...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • International journal of data mining and bioinformatics

دوره 2 3  شماره 

صفحات  -

تاریخ انتشار 2008